Weka-GDPM – Integrating Classical Data Mining Toolkit to Geographic Information Systems
نویسندگان
چکیده
Geographic data preprocessing is the most effort and time consuming step in spatial data mining. In order to facilitate geographic data preprocessing and increase the practice of spatial data mining, this paper presents Weka-GDPM, an interoperable module that supports automatic geographic data preprocessing for spatial data mining. GDPM is implemented into Weka, which is a free and open source classical data mining toolkit that has been widely used in academic institutions. GDPM follows the Open GIS specifications to support interoperability with Geographic Information Systems. It automatically generates data at two granularity levels without using prior knowledge and provides support for both distance and topological spatial relationships.
منابع مشابه
A Reuse-based Spatial Data Preparation Framework for Data Mining
The constant increase in use of geographic data in different application domains has resulted in large amounts of data stored in spatial databases and in the desire of data mining. Many solutions for spatial data mining have been proposed. Most create data mining languages or extend existing query languages to support data mining operations. This paper presents an interoperable framework for sp...
متن کاملWeka-STPM: from trajectory samples to semantic trajectories
Enormous quantities of trajectory data are collected from many sources, as GPS devices and mobile phones, as sequences of points. These data can be used in many application domains such as traffic management, urban planing, tourism, and bird migration. However, in most applications a higher level of abstraction should be used instead of sample points. In this paper we present an extension of th...
متن کاملGeoSTAT - A System for Visualization, Analysis and Clustering of Distributed Spatiotemporal Data
Nowadays, there is a considerable amount of spatiotemporal data available on the web. The visualization of these data requires several visual resources which helps users to have a correct interpretation of the data set. Furthermore, the use of data mining algorithms has proven relevant in helping the exploratory analysis of spatiotemporal data. This paper proposes the GeoSTAT (GEOgraphic Spatio...
متن کاملWeka4WS: A WSRF-Enabled Weka Toolkit for Distributed Data Mining on Grids
This paper presents Weka4WS, a framework that extends the Weka toolkit for supporting distributed data mining on Grid environments. Weka4WS adopts the emerging Web Services Resource Framework (WSRF) for accessing remote data mining algorithms and managing distributed computations. The Weka4WS user interface is a modified Weka Explorer environment that supports the execution of both local and re...
متن کاملWSRF Services for Composing Distributed Data Mining Applications on Grids: Functionality and Performance
The Web Services Resource Framework (WSRF) has recently emerged as the standard for the implementation of Grid applications. WSRF can be exploited for developing high-level services for distributed data mining applications. This paper describes Weka4WS, a framework that extends the widely-used Weka toolkit for supporting distributed data mining on WSRF-enabled Grids. Weka4WS adopts the WSRF tec...
متن کامل